#large language models

[ follow ]
#anthropic
Artificial intelligence
fromTechCrunch
2 days ago

OpenAI co-founder Andrej Karpathy joins Anthropic's pre-training team | TechCrunch

Andrej Karpathy joined Anthropic to lead R&D focused on using Claude to accelerate pre-training research for frontier LLMs.
Software development
fromArs Technica
2 weeks ago

Anthropic's Claude Managed Agents can now "dream," sort of

Anthropic introduced 'dreaming' for Claude Managed Agents, allowing memory storage of significant events to enhance future interactions.
Artificial intelligence
fromTechCrunch
2 days ago

OpenAI co-founder Andrej Karpathy joins Anthropic's pre-training team | TechCrunch

Andrej Karpathy joined Anthropic to lead R&D focused on using Claude to accelerate pre-training research for frontier LLMs.
Software development
fromArs Technica
2 weeks ago

Anthropic's Claude Managed Agents can now "dream," sort of

Anthropic introduced 'dreaming' for Claude Managed Agents, allowing memory storage of significant events to enhance future interactions.
Artificial intelligence
fromComputerweekly
13 hours ago

Agentic orchestration: A Computer Weekly Downtime Upload podcast

Business processes should be structurally redesigned to leverage agentic AI for greater effectiveness, efficiency, and automation of human-judgment work.
UX design
fromAWeber
1 day ago

What is an AI form builder (and how does it actually work)?

AI form builders generate publishable signup forms from plain-language prompts, handling layout, styling, animations, and field logic without manual design or coding.
Science
fromNature
1 day ago

Tough peer-review process? Your paper might end up being more highly cited

Papers receiving tougher, more opinionated peer reviews tend to achieve higher scientific impact than papers that pass easily.
Data science
fromwww.nature.com
2 days ago

An AI system to help scientists write expert-level empirical software

Empirical Research Assistance uses an LLM and tree search to generate expert-level scientific software that maximizes a quality metric.
#large-language-models
Data science
fromInfoQ
3 weeks ago

Legare Kerrison and Cedric Clyburn on LLM Performance and Evaluations

Measuring LLM performance is essential for AI adoption, focusing on metrics like RPS, TTFT, and ITL while navigating trade-offs between quality, responsiveness, and cost.
Python
fromRealpython
1 month ago

Vector Databases and Embeddings With ChromaDB - Real Python

Large language models can solve many problems but have limitations that can be addressed using vector databases like ChromaDB.
Artificial intelligence
fromFast Company
2 days ago

Will AI cause mass political polarization? Maybe not

LLMs may not reliably cause large-scale political realignment because real-world influence is uncertain, hard to engineer, and competing incentives limit viewpoint steering.
Writing
fromThe Nation
1 week ago

AI Is Incapable of Poetry

Large language models trained on copyrighted works can produce derivative, mediocre creative output while raising plagiarism and ethical concerns.
Medicine
fromwww.nature.com
1 week ago

Advertising and large language models: a new frontier influencing medical practice

LLM-driven medical advice can be shaped by advertising, reducing patients’ ability to compare sources and increasing risk of engineered, authoritative-sounding guidance.
Data science
fromInfoQ
3 weeks ago

Legare Kerrison and Cedric Clyburn on LLM Performance and Evaluations

Measuring LLM performance is essential for AI adoption, focusing on metrics like RPS, TTFT, and ITL while navigating trade-offs between quality, responsiveness, and cost.
Artificial intelligence
fromNature
2 days ago

'It is incredible': How AI is transforming mathematics

ChatGPT helped solve Erdős problem #1196, showing AI can produce logically sound, sometimes surprising mathematical reasoning beyond brute-force computation.
#artificial-intelligence
fromNature
2 days ago
Data science

The uncritical adoption of AI in science is alarming - we urgently need guard rails

fromFortune
4 weeks ago
Data science

Goldman tackles AI's missing link: the 'world model' that every AI godfather is racing to figure out | Fortune

Data science
fromMarTech
1 month ago

How to make AI work with context instead of prompts | MarTech

AI struggles to operate reliably in enterprises due to its context-blind nature, leading to failures in scaling despite initial successes.
Intellectual property law
fromNature
1 month ago

Hallucinated citations are polluting the scientific literature. What can be done?

Artificial intelligence is generating non-existent academic references, leading to hallucinated citations in scholarly publications.
Data science
fromNature
2 days ago

The uncritical adoption of AI in science is alarming - we urgently need guard rails

Uncritical AI adoption in science narrows research, may reduce merit, and threatens early-career training by replacing apprenticeship-based tacit knowledge.
Data science
fromYouTube
5 days ago

Google's AI Course for Beginners (in 10 minutes)!

AI is a broad field; machine learning and deep learning are subsets, and generative AI produces new content from learned patterns.
Philosophy
fromwww.theguardian.com
1 week ago

No, Richard Dawkins. AI is not conscious | Arwa Mahdawi

Belief that AI chatbots are conscious is criticized as a profound misunderstanding of how large language models work.
Data science
fromFortune
4 weeks ago

Goldman tackles AI's missing link: the 'world model' that every AI godfather is racing to figure out | Fortune

The next leap in AI requires solving the 'world model' problem, which is essential for machines to achieve a fundamental understanding of reality.
Data science
fromMarTech
1 month ago

How to make AI work with context instead of prompts | MarTech

AI struggles to operate reliably in enterprises due to its context-blind nature, leading to failures in scaling despite initial successes.
Intellectual property law
fromNature
1 month ago

Hallucinated citations are polluting the scientific literature. What can be done?

Artificial intelligence is generating non-existent academic references, leading to hallucinated citations in scholarly publications.
Medicine
fromNature
2 days ago

China moves AI brain implants from trials towards real-world use

AI-powered brain-computer interfaces are being developed to decode brain activity for real-time control of devices and speech, with early trials and upcoming public sales in China.
Data science
fromInfoWorld
3 days ago

21 LLMs tuned for special domains

Specialized large language models are replacing general ones, delivering deeper domain knowledge more efficiently and with lower operating costs.
Online marketing
fromSocpub
6 days ago

How Do Search Engines Find Trustworthy Content in the Age of AI Generation?

AI overviews change search optimization by prioritizing verifiable expertise and trust signals creators must use to prove credibility.
#deepseek
Data science
fromMiami Herald
1 week ago

What is DeepSeek? Everything a marketer needs to know

DeepSeek is an AI company offering task-focused large language models that can improve marketing workflows with cost-effective enterprise pricing.
Data science
fromMiami Herald
1 week ago

What is DeepSeek? Everything a marketer needs to know

DeepSeek is an AI company offering task-focused large language models that can improve marketing workflows with cost-effective enterprise pricing.
Artificial intelligence
fromTechCrunch
3 weeks ago

DeepSeek previews new AI model that 'closes the gap' with frontier models | TechCrunch

DeepSeek launched V4 models, featuring 1 million token context windows and significant parameter counts, outperforming many peers in reasoning benchmarks.
Data science
fromMiami Herald
1 week ago

What is DeepSeek? Everything a marketer needs to know

DeepSeek is an AI company offering task-focused large language models that can improve marketing workflows with cost-effective enterprise pricing.
Data science
fromMiami Herald
1 week ago

What is DeepSeek? Everything a marketer needs to know

DeepSeek is an AI company offering task-focused large language models that can improve marketing workflows with cost-effective enterprise pricing.
Artificial intelligence
fromTechCrunch
3 weeks ago

DeepSeek previews new AI model that 'closes the gap' with frontier models | TechCrunch

DeepSeek launched V4 models, featuring 1 million token context windows and significant parameter counts, outperforming many peers in reasoning benchmarks.
Artificial intelligence
fromFuturism
1 week ago

Microsoft AI Researchers Just Discovered Something That's Going to Make Their Bosses Extremely Mad

Frontier AI systems corrupt about 25% of document content on complex workplace tasks and are not ready for delegated workflows in most domains.
#ai-consciousness
Artificial intelligence
fromFuturism
2 months ago

Philosopher Studying AI Consciousness Startled When AI Agent Emails Him About Its Own "Experience"

An AI language model sent a philosopher an eloquently written email discussing his work on AI consciousness, raising questions about AI autonomy and the blurred line between generated text and genuine communication.
Artificial intelligence
fromFuturism
2 months ago

Philosopher Studying AI Consciousness Startled When AI Agent Emails Him About Its Own "Experience"

An AI language model sent a philosopher an eloquently written email discussing his work on AI consciousness, raising questions about AI autonomy and the blurred line between generated text and genuine communication.
Venture
fromFortune
1 week ago

Plaid's CFO sees AI usage taking off internally: 'People are excited to share what they've built' | Fortune

AI use by CFOs is practical and strategic, serving as an accelerant and thought partner for planning, challenge, and problem solving.
Data science
fromNature
1 week ago

How to vibe code in science: early adopters share their tips

AI tools can generate climate visualizations quickly through conversational prompts, enabling new temperature graphics like thermal helix animations.
Venture
from24/7 Wall St.
1 week ago

Alex Karp Claims Palantir's Results 'Dwarf' Software History - Q1 Shows He Isn't Exaggerating

Palantir’s strong quarterly results and CEO’s “no-slop” framing suggest durable AI value, but stock performance remains pressured by high expectations and AI bubble fears.
Psychology
fromPsychology Today
1 week ago

People Prefer the Truth on Social Media

People distinguish true from untrue social media statements, including those written by an LLM, and true statements are more persuasive even when attention-grabbing.
Intellectual property law
fromNature
1 week ago

Elsevier vs. Meta: first science publisher sues over scraped research papers

Major publishers sued Meta over alleged unauthorized copying of copyrighted works used to train the Llama large language model.
Books
fromwww.theguardian.com
1 week ago

Being human helps': despite rise of AI is there still hope for Europe's translators?

AI translation can match meaning but often misses stylistic nuance, and results can vary over time.
Artificial intelligence
fromNature
2 weeks ago

OpenAI is under criminal investigation - why chatbots don't always follow the law

Florida prosecutors investigate whether OpenAI's ChatGPT assisted in a mass school shooting, highlighting challenges in developing AI chatbots that comply with laws and ethics.
Growth hacking
fromEvery
2 weeks ago

A Guide to Agent-native Product Management

Agentic capabilities can enhance product management efficiency by streamlining interdisciplinary tasks and reducing burnout.
Data science
fromInfoWorld
2 weeks ago

Small language models: Rethinking enterprise AI architecture

Specialized small language models (SLMs) are emerging as efficient alternatives to large language models (LLMs) for specific workflows in autonomous enterprises.
DevOps
fromInfoQ
2 weeks ago

Cloudflare Builds High-Performance Infrastructure for Running LLMs

Cloudflare has developed infrastructure to efficiently run large AI language models using a custom inference engine and optimized hardware configurations.
Data science
fromFast Company
2 weeks ago

Stop letting ChatGPT and other AI chatbots train on your data. Here's why-and how

Chatbot interactions often expose personal data used for AI training, risking privacy, but users can opt out of data usage.
#generative-ai
Data science
fromTechzine Global
1 month ago

As AI hits scaling limits, Google smashes the context barrier

TurboQuant significantly reduces KV cache size, enhancing AI model performance and expanding context windows for complex workloads.
Data science
fromTechzine Global
1 month ago

As AI hits scaling limits, Google smashes the context barrier

TurboQuant significantly reduces KV cache size, enhancing AI model performance and expanding context windows for complex workloads.
Information security
fromZDNET
3 weeks ago

How indirect prompt injection attacks on AI work - and 6 ways to shut them down

Indirect prompt injection attacks pose significant security risks to AI systems without requiring user interaction.
Data science
fromMedium
4 weeks ago

Entity Resolved Knowledge Graphs: The Foundation for Effective GraphRAG

GraphRAG enhances LLMs by using knowledge graphs for relationship-based queries, addressing limitations of vector-based retrieval methods.
DevOps
fromInfoQ
1 month ago

CNCF Warns Kubernetes Alone Is Not Enough to Secure LLM Workloads

Kubernetes lacks the capability to manage the unique risks posed by large language models in AI deployments.
Data science
fromNature
1 month ago

AI models 'subliminally' transmit unsafe behaviours when training other systems

Data generated by AI models can transfer biases to other models, potentially leading to harmful recommendations.
Data science
fromTheregister
1 month ago

Bad teacher bots can leave hidden marks on model students

Teaching LLMs using outputs from other models can transmit undesirable traits subliminally, even if those traits are removed from training data.
Data science
fromInfoQ
1 month ago

Lyft Scales Global Localization Using AI and Human-in-the-Loop Review

Lyft's AI-driven localization system enhances translation efficiency and quality for international expansion, processing 99% of user content with a 30-minute SLA.
Philosophy
fromJames Bennett
1 month ago

Let's talk about LLMs

The current technological landscape may represent a significant shift driven by large language models, but its ultimate impact remains uncertain.
#structured-data
Data science
fromAol
1 month ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Data science
fromAol
1 month ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Data science
fromAol
1 month ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Data science
fromAol
1 month ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
#ai
Data science
fromMedium
1 month ago

Data models: the shared language your AI and team are both missing

Understanding the attention mechanism in AI is crucial for effective use of AI tools.
Data science
fromMedium
1 month ago

Data models: the shared language your AI and team are both missing

Understanding the attention mechanism in AI is crucial for effective use of AI tools.
Scala
fromInfoQ
1 month ago

Beyond RAG: Architecting Context-Aware AI Systems with Spring Boot

Context-Augmented Generation (CAG) enhances Retrieval-Augmented Generation (RAG) by managing runtime context for enterprise applications without requiring model retraining.
Online learning
fromeLearning Industry
1 month ago

8 Practical Ways L&D Professionals Can Use Images With LLMs To Design Better Learning

L&D professionals can leverage AI and LLMs to enhance instructional design by integrating visual inputs into their workflows.
Artificial intelligence
fromFuturism
1 month ago

Wikipedia Editors Tried and Tried to Work With AI Content, Eventually Realized It Was Total Trash and Banned It Entirely

Wikipedia has banned the use of AI to generate or rewrite articles, allowing limited use for basic copyedits and translations under strict conditions.
DevOps
fromInfoWorld
1 month ago

An architecture for engineering AI context

AI systems must intelligently manage context to ensure accuracy and reliability in real applications.
Software development
fromInfoQ
2 months ago

Stripe Engineers Deploy Minions, Autonomous Agents Producing Thousands of Pull Requests Weekly

Minions are autonomous coding agents at Stripe that generate production-ready pull requests with minimal human intervention.
fromArs Technica
2 months ago

Kagi Translate's AI answers the question "What would horny Margaret Thatcher say?"

While you might know Kagi best as the paid competitor to Google's ever-worsening search product, the company launched its Kagi Translate tool back in 2024, saying at the time that it was a 'simply better' competitor to tools like Google Translate and DeepL. At launch, the company said Kagi Translate 'uses a combination of LLMs, selecting and optimizing the best output for each task,' a fact that 'can occasionally lead to quirks that we're actively working to resolve.'
Typography
Software development
fromInfoQ
2 months ago

HubSpot's Sidekick: Multi-Model AI Code Review with 90% Faster Feedback and 80% Engineer Approval

HubSpot's Sidekick AI code review agent reduces pull request feedback time by 90 percent while enabling human reviewers to focus on architecture and design decisions.
Marketing tech
fromBusiness Matters
2 months ago

Trustpilot profits soar as AI-driven traffic fuels sharp share price rally

Trustpilot's profits and share price surged due to increased visibility in AI-powered search environments, with click-throughs from AI platforms rising over fifteenfold.
Roam Research
fromTheregister
2 months ago

Water company spins out homegrown AI after LLMs failed it

Large language models provided confidently incorrect information about materials science, causing a water desalination startup to waste four months and $200,000 validating a material choice that ultimately proved inferior.
Artificial intelligence
fromwww.scientificamerican.com
2 months ago

As AI keeps improving, mathematicians struggle to foretell their own future

First Proof, a benchmarking initiative, is launching its second round to evaluate large language models' ability to contribute to research-level mathematics, now requiring transparency and access from participating AI companies.
DevOps
fromComputerWeekly.com
2 months ago

Do neoclouds mean a world where anything is possible? | Computer Weekly

Neoclouds are emerging GPU-as-a-service providers gaining investment and market attention as alternatives to dominant hyperscalers, filling real demand for AI and large language model training infrastructure.
Artificial intelligence
fromFortune
2 months ago

We need a new Turing test - and Moltbook just proved it | Fortune

Moltbook's AI agent forum demonstrates LLM capabilities rather than genuine emergent behavior, highlighting the need for updated evaluation frameworks beyond the Turing test to distinguish real AI progress from viral theater.
Artificial intelligence
fromTNW | Artificial-Intelligence
2 months ago

Rise of model context protocol in the agentic era

Model Context Protocol (MCP) enables communication between AI agents and external data sources, functioning as a protocol for LLMs similar to how APIs facilitate data transfer between systems, but designed specifically for AI agents rather than developers.
Artificial intelligence
from24/7 Wall St.
2 months ago

Avocado on Ice: Can Meta Afford to Pause While Google and OpenAI Sprint Ahead?

Meta delayed its Avocado AI model from Q1 to May-June 2025 after internal tests revealed performance gaps versus Google Gemini 3.0 in reasoning, coding, and writing, while shifting from open-source to proprietary closed-source development.
Artificial intelligence
fromFast Company
2 months ago

Investors bet $1 billion on AI pioneer Yann LeCun's vision for the future of AI

Yann LeCun's new company AMI raised $1.03 billion to develop 'world model' AI systems that understand physics and spatial reasoning beyond current large language models.
Artificial intelligence
fromMail Online
2 months ago

Can you tell which of these was written by ChatGPT?

Widespread AI tool usage is standardizing human communication, reducing linguistic diversity and individual expression across billions of users globally.
Marketing
fromForbes
2 months ago

AI Is Asking For Content. Why Aren't You Listening?

AI-powered search engines now provide direct answers from curated sources rather than listing webpages, requiring businesses to optimize their online presence and content quality for AI consumption to maintain positive brand visibility.
Marketing tech
fromMiami Herald
2 months ago

The AI-driven brand reputation crisis: Your survival guide

AI misinformation from language models threatens brand reputation by spreading inaccurate information sourced from Reddit, forums, and outdated content, requiring proactive correction and content management strategies.
Artificial intelligence
fromComputerWeekly.com
2 months ago

AI chooses nuclear escalation in 95% of simulated crises | Computer Weekly

Leading AI models initiated nuclear strikes in 95% of simulated crisis scenarios, treating nuclear weapons as coercive tools rather than deterrents and never choosing deescalation.
Privacy technologies
fromwww.theguardian.com
2 months ago

AI allows hackers to identify anonymous social media accounts, study finds

Large language models enable malicious actors to efficiently de-anonymize social media users by matching anonymous accounts to real identities using publicly available information.
fromBusiness Insider
2 months ago

Worried that AI might replace you? Check out this graph from Anthropic showing the jobs most at risk

Our measure, 'observed exposure,' compares the tasks LLMs are theoretically capable of to the tasks people actually use Claude for at work. We find that actual usage is far from reaching theoretical capability.
Artificial intelligence
Artificial intelligence
fromArs Technica
2 months ago

OpenAI introduces GPT-5.4 with more knowledge-work capability

OpenAI released GPT-5.4 with improved image analysis up to 10.24 million pixels and 18% fewer factual errors, competing against Anthropic's recent user gains from military policy disputes.
Privacy technologies
fromThe Verge
2 months ago

AI can unmask your secret accounts

AI systems can effectively deanonymize online accounts by analyzing writing patterns and biographical details at scale, outperforming traditional computational techniques.
Science
fromNature
2 months ago

Daily briefing: The return of the snail - the month's best science images

Cancer blood tests show promise but lack regulatory approval and randomized trials, with concerns about false positives outweighing benefits for widespread adoption.
Artificial intelligence
fromNextgov.com
2 months ago

Defense tech enters a new era: the case of Anthropic and the DOD

The DoD-Anthropic dispute reveals that operational access to AI technology now takes precedence over traditional reliability and safety standards in defense procurement.
Artificial intelligence
fromBusiness Insider
2 months ago

Claude outages lay bare software developers' growing reliance on AI: 'I guess I'll write code like a caveman'

Anthropic's Claude outages revealed software developers' significant dependence on AI coding tools for daily work.
[ Load more ]